Picture for Jie Zhang

Jie Zhang

Chongqing Jinshan Science & Technology

Learning to Inject: Automated Prompt Injection via Reinforcement Learning

Add code
Feb 05, 2026
Viaarxiv icon

Unifying Watermarking via Dimension-Aware Mapping

Add code
Feb 03, 2026
Viaarxiv icon

EMFormer: Efficient Multi-Scale Transformer for Accumulative Context Weather Forecasting

Add code
Feb 01, 2026
Viaarxiv icon

DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems

Add code
Jan 31, 2026
Viaarxiv icon

When Classes Evolve: A Benchmark and Framework for Stage-Aware Class-Incremental Learning

Add code
Jan 31, 2026
Viaarxiv icon

Character as a Latent Variable in Large Language Models: A Mechanistic Account of Emergent Misalignment and Conditional Safety Failures

Add code
Jan 30, 2026
Viaarxiv icon

Contrastive Spectral Rectification: Test-Time Defense towards Zero-shot Adversarial Robustness of CLIP

Add code
Jan 27, 2026
Viaarxiv icon

RvB: Automating AI System Hardening via Iterative Red-Blue Games

Add code
Jan 27, 2026
Viaarxiv icon

GUIGuard: Toward a General Framework for Privacy-Preserving GUI Agents

Add code
Jan 26, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon